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Abstract: The advancements in the photogrammetry and remote sensing technologies has made it possible to extract 
useful tangible information from data which plays a pivotal role in various application such as management and mon- 
itoring of forests and agricultural lands etc. This study aimed to evaluate the effectiveness of spectral signatures for 
extraction of tea gardens from 1:5000 scaled digital orthophoto maps obtained from Rize city in Turkey. First, the 
normalized difference vegetation index (NDVI) was derived from the input images to suppress the non-vegetation 
areas. NDVI values less than zero were discarded and the output images was normalized in the range 0 - 255. Individ- 
ual pixels were then mapped into meaningful objects using global region growing technique. The resulting image was 
filtered and smoothed to reduce the impact of noise. Furthermore, geometrical constraints were applied to remove 
small objects (less than 500 pixels) followed by morphological opening operator to enhance the results. These objects 
served as building blocks for further image analysis. Finally, for the classification stage, a range of spectral values 
were empirically calculated for each band and applied on candidate objects to extract tea gardens. For accuracy as- 
sessment, we employed an area based similarity metric by overlapping obtained tea garden boundaries with the man- 
ually digitized tea garden boundaries created by experts of photogrammetry. The overall accuracy of the proposed 
method scored 89% for tea gardens from 10 sample orthophoto maps. We concluded that exploiting the spectral sig- 
natures using object based analysis is an effective technique for extraction of dominant tree species from digital or- 
thophoto maps. 


Keywords: Tree segmentation, tea garden classification, normalized difference vegetation index, morphological pro- 
cessing, region growing 


1. Introduction 

The advancements in the remote sensing technologies has made it possible to extract useful tangible information which 
plays a pivotal role in various application such as management and monitoring of forests, crops, agricultural land etc. 
(Zhou et al. 2016). High resolution multispectral imagery data is widely used for the aforementioned applications. 
Although multispectral imagery provides detailed information, nevertheless such details can introduce intra-class var- 
iations and inter-class similarities there by escalating the complexity of the problem at hand (Paula, et al. 2010; Qin 
2014). It requires extraction of highly discriminative features that could distinguish between various tree species. 

In recent years, photogrammetry and remote sensing techniques have been exploited for vegetation type classifica- 
tion for assessment of agricultural landscape (Pham, Brabyn, and Ashraf 2016). A variety of sensors are available to 
obtain high resolution images, with high spectral and spatial resolution: space borne and airborne multi-spectral im- 
agers, Synthetic Aperture Radar (SAR) and airborne Light Detection And Ranging (LiDAR) (Palenichka et al. 2013). 
The choice of sensor is very important and application specific. For example, to detect large vegetation areas such as 
tea gardens, a coarse resolution image can be used. On the other hand, if the objective is to detect fine scale objects, 
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such as individual trees, a high spectral resolution imagery data is more useful. Most of the studies have used visible 
and near infrared bands of the spectrum for tree specifies classification (Dalponte et al. 2013). Additionally, combining 
heterogeneous data obtained from various sensors is also useful as they provides complementary information about 
classes of interest (Bandyopadhyay, van Aardt, and Cawse-Nicholson 2013; Colgan et al. 2012; Sugumaran and Voss 
2007). 

A number of methods employed various indices for extraction of vegetation information from remote sensing data 
(Pena-Barragana et al. 2004). Traditionally, NDVI has widely been used to distinguish vegetation and non-vegetation 
areas (Adhikari et al. 2015). In addition, other vegetation indices have been proposed such as excessive green index 
(Qin 2014), greenness and brightness (Adhikari et al. 2015), ratios between spectral bands (Pena-Barragana et al. 
2004) etc. as a measure for vegetation information extraction. These methods exploit spectral reflectance information 
captured by sensors that characterizes the vegetation. 

Generally, the image analysis methods employ either pixels or objects as the basic unit of measurement. Many 
methods have employed pixel-based methods for image analysis (Deng et al. 2014). However, applying pixel-based 
methods on high spatial resolution images pose substantial challenges for image segmentation due to inter-class spec- 
tral variations. On the other side, object based image analysis (OBI A) methods employ objects as basic unit of meas- 
urement (Blaschke 2010). OBI A has received considerable attention in the recent era and it has widely been used for 
remote sensing applications (Colgan et al. 2012; Sugumaran and Voss 2007; Tzotsos and Argialas 2008; Walter 2004). 
It offers various advantages over traditional pixel based methods, e.g. it reduces the inter-class variations and further 
information can be derived from objects such as spatial and contextual information. These characteristics make OBI A 
a strong candidate for remote sensing image analysis. 

The objective of this research is to identifying tea gardens from 1:5000 scaled digital orthophoto maps obtained 
from Rize city in Turkey. The tea trees are small in size and appear in groups which offer a unique texture.. We 
exploited these characteristics to distinguish it from other types of vegetation. There are two primary objective of the 
proposed study: 1) to transform the raw pixels into a meaningful object representation using region growing technique. 
2) classification of tea gardens from rest of vegetation. Traditional NDVI index was employed to distinguish between 
vegetation and non-vegetation areas. Then, a regions growing technique was used to transform the pixels into an 
object-based representation. Some geometrical and morphological techniques were also applied to suppress the non- 
candidate objects. Finally, a spectral range filter was applied on the candidate objects to obtain the tea gardens. In 
order to improve further accuracy of the output, an optional manual editing step was also added in the final release of 
the software. The proposed method was implemented in MATLAB and Emgu CV. 


2. Study Area 


Rize is located in the eastern Black Sea part of Turkey (Coordinates: 41°01'29"N 40°31'20"E). The region is char- 
acterized by heavy rain and landscape which has both valleys and steep mountain ranges. The region is naturally more 
suitable for agriculture and forest and is popular for tea production. The high quality tea is not only distributed across 
the country but it is also exported to other countries in Europe and Asia. The area in Rize is mostly dense with various 
vegetation types, including tea and other types of vegetation. Moreover, build areas, roads, and bare land can also be 
seen in the images. Fig. 1 (a) shows the map of the Rize while (b) shows a sample image of a tea garden. 
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3. Materials and Method 

The aim of this study is to automatically extract the boundaries of tea garden from digital orthophoto maps. The 
images were preprocessed by applying Gaussian low pass filter to remove noise such as salt and pepper effects. NDVI 
was then derived from preprocessed images to obtain vegetation areas. Then, region growing technique was employed 
to transform the raw pixels into an object based representation. Geometrical and morphological operations were ap- 
plied to remove noisy elements, such as small objects, holes within objects etc. Finally, for classification a range filter 
was applied to the spectral signature to distinguish tea gardens from rest of vegetation and other types of land cover 
classes. The main workflow is divided into four sub-tasks (Fig. 2), namely: normalized difference vegetation index 
calculation, image segmentation, image filtering, and classification. The following sections describe each step in de- 
tails. 



Fig. 2: Workflow of the proposed method for tea garden extraction from digital orthophoto maps 


3.1 Dataset 

The data used for the current study was provided by EMI group, which was obtained on March 03, 2013 using 
ULTRACAM-X digital aerial camera. The flight was made over tea gardens in various regions in Rize city. The data 
was acquired to make assessments of the area covered by tea gardens in the region. It covered both visible and near 
infrared parts of the electromagnetic spectrum. The acquired images were then ortho-rectified and scaled to 1 :5000 so 
that direct measurement could be performed. Four spectral bands were used for the current study: red (R), green (G), 
blue (B) and near infrared (NIR). The characteristics of each band are summarized in Table 1 (Alvarez et al. 2010). 


Table 1 . Summary of band information 


Band Name 

Wavelength (nm) 

Blue 

400-600 

Green 

480- 660 

Red 

580-720 

Near Infrared 

620-1000 
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3.2 Normalize Difference Vegetation Index 


Since the images contained collection of heterogeneous objects in addition to tea gardens, such as other types of 
vegetation and manmade objects (houses, roads, forest etc.), therefore, naturally, the initial step was to distinguish 
vegetation from non-vegetation areas, and then proceed to distinguish tea gardens from rest of the vegetation. This 
preprocessing step offered two main advantages, (i) obtaining vegetation areas only (ii) reduction of processing time. 
Fig. 3 (a) shows the NDVI image obtained from original image while (b) shows the image after thresholding and 
normalization. 

For vegetation area extraction, the normalized difference vegetation index (NDVI) was derived from the input 
image. NDVI values are obtained by taking the ratio between difference and sum of NIR and R bands of the multi- 
spectral image (Jeevalakshmi, Reddy, and Manikiam 2016). Mathematically: 


NDVI = 


( NIR - R) 
( NIR + R) 


( 1 ) 


Normally, the values of NDVI lie in the range of -1 and +1. Higher values in the NDVI indicate the presence of 
high vegetation while lower values indicated less or non-vegetation. In our experiment, we only considered values 
greater than threshold value 0.2, which is an indicator for presence of green patches, while values less than the thresh- 
old value were discarded (i.e. set to 0). The resulting image was normalized to the range 0-255. 



Fig. 3 (a) NDVI image obtained from original image (b) Image after thresholding and normalization. 


3.3 Image Segmentation 

The main objective of image segmentation is to partition the image into a set of disjoint regions which are homo- 
geneous segments based on a similarity metric such as gray level, texture or color. We employed region based image 
segmentation as it is stable against noise and produces a thin boundary between different regions (Li et al. 2015). The 
algorithm examines each pixel and adds to the regions if the gray intensities are similar to the region. Otherwise, the 
pixel is discarded and set as background (intensity is set to 0). Moreover, we employed a thresholding technique for 
region growing based on the average gray value for the set of seed points and a user define threshold (t). 

We proceed to segment the image by selecting a set of seed points from the NDVI image. The user manually selects 
these seed points from the region of interest in the image. The gray values for these seed points are obtained and 
averaged to get their mean value ( m ). Moreover, a threshold value (t) is also defined by the user which adds an extra 
constraint so that only those pixels will be added to the region which are within the range of upper and lower con- 
straints obtained from mean and threshold values. To create homogeneous objects, the proposed algorithm then scans 
the whole image pixel by pixel from right to left and top to bottom and add each pixel to the region if it satisfy 


c\ if (m — t) < 7(x,y) < (m + t) 

g(x,y) = ( 0 otherwise 


(i) 
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Where g is the filtered image, I is the NDVI image and g{x,y ) and 7(x,y) represents intensities at the loca- 
tion^, y) for filtered and NDVI images respectively. Here, gis a binary image where foreground pixels are candidates 
for vegetation areas while background represents non-vegetation areas. These segments are then used as building 
blocks for classification described in the following sections. 


3.4 Image Filtering 

The image segmentation resulted in an object-based representation of the candidate tea garden objects. This repre- 
sentation, however, also contained noisy elements such as small objects and other isolated green patches. Naturally, 
these noisy objects should be removed first before proceeding to the next steps. To do so, we exploited morphological 
processing, applied geometrical constraints and hole filling procedures to suppress the non-candidate objects. Fig. 4 
shows the results obtained for each step. 

Mathematical morphological is a technique for analyzing the geometrical structure of images (Gu et al. 2016). The 
output image is obtained from morphological operation by comparing the corresponding pixels in the input image 
within a fixed neighborhood. The size and shape of neighborhood is defined by a template called structuring element. 
We employed morphological opening on the binary image obtained from segmentation step. Morphological opening 
performs erosion followed by dilation. Dilation tends to add pixels towards the boundaries of objects, while erosion 
tends to remove pixels along the boundaries. A square structuring element of size 5x5 was applied on the segmented 
image. This resulted in removing small gaps between objects and smoothing the object boundaries. 

A set of geometrical constraints were then applied to further refine output image. Connected components were 
calculated and bounding boxes were defined for each object. The geometrical constraints were then applied and the 
objects were removed with areas less than 500 pixels, width and height of the bounding box less than 200 pixels. This 
removed small objects which were not candidate tea garden objects. 

Finally, small holes were filled within the candidate tea garden objects which were not eliminated by the morpho- 
logical operation. To achieve this goal, the binary images were complemented and connected component were calcu- 
lated. Again, geometrical constraints were applied to remove objects whose areas were less than 100 pixels. To keep 
the candidate tea garden objects in white while background in dark color, the filtered image was complemented. 



Fig 4. Sample outputs obtained for each intermediate steps, (a) Regions growing image segmentation, (b) Morphological Opening, (c) 
Area based geometrical constraints applied to remove small regions and (d) Application of hole-filling to remove small holes within 
objects 
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3.5 Classification 


Taking spectral signatures into account, the final classification of objects into tea gardens and rest of land cover 
classes was obtained. A spectral range was empirically calculated by observing the spectral values for tea gardens. 
For each candidate object obtained from previous step, we calculated mean spectral values by using spatial contextual 
information. Then, final range filter was applied to classify the objects into either tea gardens or rest of land cover 
classes. An object is assigned to tea gardens if its mean spectral value lies in the specified range otherwise, it is 
discarded assuming it is a non- tea object. Fig. 5 (a) shows the results obtained for classification and (b) shows the 
boundaries for each tea gardens object. 



Obviously, the proposed method could distinguish between tea gardens and rest of non- vegetation areas with high 
accuracy. However, there were some difficulties in distinguishing tea gardens from other types of vegetation (forest, 
other types of trees) whose spectral signatures are similar to that of tea gardens. To deal with this situation, a manual 
editing step was also included with the software so that users can edit the miss classified objects. 


4. Experimental Results 

In this study, spectral signatures were employed for extraction of tea gardens from 1 :5000 scaled orthophoto maps. 
Four bands (red, green, blue, and near infrared) were used for the tea garden classification. Although various sophis- 
ticated methods have been proposed for evaluation e.g. (Whiteside, Maier, and Boggs 2014; Zhang 2001; Zylshal et 
al. 2016), yet we employed most commonly used area based accuracy assessment. Both classified and manually dig- 
itized images were first converted into binary images, with white color representing tea gardens while black color 
indicating background and then compared to find the accuracy of the propose method. We used the area-based method 
for accuracy assessment proposed by (Whiteside, Maier, and Boggs 2014). 

The experiments were performed on 10 images to evaluate the proposed method. Table 2 summarized the perfor- 
mance of the propose technique. The overall accuracy obtained was 89 %, user accuracy 88%, producer accuracy 
93%, false positive were 12% and false negatives were 7 %. Each obtained tea gardens from proposed method were 
compared with manually delineated tea gardens. To obtain overall accuracy, we summed up area of intersection for 
all images and divided it by total area of the ground truth images. 


Table 2. Area-based accuracy assessment 


Class 

OA(%) 

UA(%) 

PA(%) 

FP (%) 

FN(%) 

Tea Gardens 

89 

88 

93 

12 

7 
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Suppose, C represents the area of classified image obtained from proposed method and R be the area of manually 
digitized images (ground truth). C 0 R is the area of intersection between C and R , C n R represents the false pos- 
itive (FP), -i C n R represent false negatives (FN). Moreover, the producer’s accuracy (PA) and user’s accuracy (UA) 
were also calculated to assess the classification accuracy for each class. Finally, the overall accuracy ( OA ) was ob- 
tained by summing up the area of intersection for all images and dividing by the all ground truth area. The summary 
of the assessment measures are summarized in Table 3. 


Table 3. Area-based accuracy assessment 


Measure 

Formula 

User’s Accuracy 

\C r\R | 

|C| 

Producer’s Accuracy 

\C r\R | 

\R\ 

False Positives 

|Cn-,i? | 

\R\ 

False Negatives 

\— i c n R | 

\R\ 


In Fig. 5 the classification results are presented. It can be noticed that the proposed method was effective in identi- 
fying the tea gardens from 1:5000 scaled orthophoto maps. Fig. 5 (a) is the original image with 4 bands (R,G,B,NIR), 
(b) is the classified binary image, where white color represent tea gardens while black indicated rest of classes. In (c) 
binary image is overlaid on the original image to see the mapping clearly. For (d) the contours of each object in (b) is 
calculated and over laid on the original image for visual interpretations. 



(c) (d) 


Fig. 5 (a) Original digital orthophotomap (4 bands), (b) Binary image with white pixels showing tea gardens, (c) binary image 
overplayed on original image and (d) Contours drawn around tea gardens 
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The proposed methodology was implemented using MATLAB and Emgu CV. The experiments were run on a 
machine having RAM 8 GHz CPU. The computational complexity of the proposed method for various sizes of images 
is shown in Fig 6. It is observed that most of the processing time was consumed for image segmentation and classifi- 
cation tasks. The run time complexity for both algorithms is approximately 0(N 2 ). The algorithm can be accelerated 
by employing parallel programming techniques and GPU enabled processing. 



Image Size 

Fig. 6 Computational complexity of the proposed system. Plot of processing time against size of the input image. 


5. Conclusions 

Photogrammetry and remote sensing data is very important source of information that is used for tree species iden- 
tification over large geographical areas. We showed the usefulness of spectral signatures for tea garden extraction 
from 1 :5000 scaled digital orthophoto maps. NDVI was first derived from the orthophoto maps to distinguish between 
vegetation and non-vegetation areas. The raw pixels were then grouped into an object representation by exploiting 
both spatial and textural features. Then, geometrical constraints and morphological processing was performed to re- 
duce the noisy elements. Finally, a spectral signature based range filter was applied to extract tea gardens. The method 
presented here for tea garden extraction can also be adopted for other types of vegetation identification by only altering 
the model parameters. 

In future work, we plan to extend our method for land cover classification (water bodies, bare land, build areas) by 
including other type of vegetation indexes such as greenness, extended greenness index etc. Additionally, inclusion 
of a machine learning approach may be useful for classification stage. This may require derivation of new features 
from segmented objects and apply a feature selection approach to identify robust feature set that could be employed 
for classifying objects into respective classes. 
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